Bayes estimators for phylogenetic reconstruction

نویسندگان

  • Peter Huggins
  • Wenbin Li
  • David Haws
  • Thomas Friedrich
  • Jinze Liu
  • Ruriko Yoshida
چکیده

Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tree estimate that is closest on average to the samples. This "median" tree is known as the Bayes estimator (BE). The BE literally maximizes posterior expected accuracy, measured in terms of closeness (distance) to the true tree. We discuss a unified framework of BE trees, focusing especially on tree distances that are expressible as squared euclidean distances. Notable examples include Robinson-Foulds (RF) distance, quartet distance, and squared path difference. Using both simulated and real data, we show that BEs can be estimated in practice by hill-climbing. In our simulation, we find that BEs tend to be closer to the true tree, compared with ML and neighbor joining. In particular, the BE under squared path difference tends to perform well in terms of both path difference and RF distances.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Empirical Bayes Estimators with Uncertainty Measures for NEF-QVF Populations

The paper proposes empirical Bayes (EB) estimators for simultaneous estimation of means in the natural exponential family (NEF) with quadratic variance functions (QVF) models. Morris (1982, 1983a) characterized the NEF-QVF distributions which include among others the binomial, Poisson and normal distributions. In addition to the EB estimators, we provide approximations to the MSE’s of t...

متن کامل

Estimation and Reconstruction Based on Left Censored Data from Pareto Model

In this paper, based on a left censored data from the twoparameter Pareto distribution, maximum likelihood and Bayes estimators for the two unknown parameters are obtained. The problem of reconstruction of the past failure times, either point or interval, in the left-censored set-up, is also considered from Bayesian and non-Bayesian approaches. Two numerical examples and a Monte Carlo simulatio...

متن کامل

Classic and Bayes Shrinkage Estimation in Rayleigh Distribution Using a Point Guess Based on Censored Data

Introduction      In classical methods of statistics, the parameter of interest is estimated based on a random sample using natural estimators such as maximum likelihood or unbiased estimators (sample information). In practice,  the researcher has a prior information about the parameter in the form of a point guess value. Information in the guess value is called as nonsample information. Thomp...

متن کامل

Limiting Properties of Empirical Bayes Estimators in a Two-Factor Experiment under Inverse Gaussian Model

The empirical Bayes estimators of treatment effects in a factorial experiment were derived and their asymptotic properties were explored. It was shown that they were asymptotically optimal and the estimator of the scale parameter had a limiting gamma distribution while the estimators of the factor effects had a limiting multivariate normal distribution. A Bootstrap analysis was performed to ill...

متن کامل

Choice of topology estimators in Bayesian phylogenetic analysis.

Wheeler WC and Pickett KM (2008. Topology-Bayes versus clade-Bayes in phylogenetic analysis. Mol Biol Evol. 25:447-453.) discuss two ways of summarizing the posterior probability distribution of a Bayesian phylogenetic analysis, which they refer to as "topology-Bayes" and "clade-Bayes." They claim that the clade-Bayes approach leads to problems such as "exaggerated clade support, inconsistently...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 60 4  شماره 

صفحات  -

تاریخ انتشار 2011